Contextualized ranking of entity types based on knowledge graphs

نویسندگان

  • Alberto Tonon
  • Michele Catasta
  • Roman Prokofyev
  • Gianluca Demartini
  • Karl Aberer
  • Philippe Cudré-Mauroux
چکیده

A large fraction of online queries target entities. For this reason, Search Engine Result Pages (SERPs) increasingly contain information about the searched entities such as pictures, short summaries, related entities, and factual information. A key facet that is often displayed on the SERPs and that is instrumental for many applications is the entity type. However, an entity is usually not associated to a single generic type in the background knowledge graph but rather to a set of more specific types, which may be relevant or not given the document context. For example, one can find on the Linked Open Data cloud the fact that Tom Hanks is a person, an actor, and a person from Concord, California. All these types are correct but some may be too general to be interesting (e.g., person), while other may be interesting but already known to the user (e.g., actor), or may be irrelevant given the current browsing context (e.g., person from Concord, California). In this paper, we define the new task of ranking entity types given an entity and its context. We propose and evaluate new methods to find the most relevant entity type based on collection statistics and on the knowledge graph structure interconnecting entities and types. An extensive experimental evaluation over several document collections at different levels of granularity (e.g., sentences, paragraphs) and different type hierarchies (including DBpedia, Freebase, and schema.org) shows that hierarchy-based approaches provide more accurate results when picking entity types to be displayed to the end-user.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Contextualizing Grammar Instruction through Meaning-Centered Planned Pre-emptive Treatment and Enhanced Input in an EFL Context

This study has aimed to compare the effects of two types of form-focused instruction, i.e. de-contextualized focus-on-forms instruction versus meaning-centered contextualized focus-on-form instruction, on the development of grammatical knowledge of Iranian high-school students. Two groups of male high-school first graders participated in this study.  One group was taught through de-contextualiz...

متن کامل

Scalable Link-based Personalization for Ranking in Entity-Relationship Graphs

Authority flow techniques like PageRank and ObjectRank can provide personalized ranking of typed entity-relationship graphs. There are two main ways to personalize authority flow ranking: Nodebased personalization, where authority originates from a set of userspecific nodes; Edge-based personalization, where the importance of different edge types is user-specific. We propose for the first time ...

متن کامل

ESearch: Incorporating Text Corpus and Structured Knowledge for Open Domain Entity Search

The paper introduces an open domain entity search system called ESearch, which aims at finding a list of relevant entities to an open domain entity search query (a natural language question). The system is built on top of a Wikipedia text corpus, as well as the structured DBPedia knowledge base. Entities are initially ranked by a model which effectively associates context matching (based on the...

متن کامل

Learning to rank related entities in Web search

Entity ranking is a recent paradigm that refers to retrieving and ranking related objects and entities from different structured sources in various scenarios. Entities typically have associated categories and relationships with other entities. In this work, we present an extensive analysis of Web-scale entity ranking, based on machine learned ranking models using an ensemble of pair-wise prefer...

متن کامل

Learning Parameters in Entity Relationship Graphs from Ranking Preferences

Semi-structured entity-relation (ER) data graphs have diverse node and edge types representing entities (paper, person, company) and relations (wrote, works for). In addition, nodes contain text snippets. Extending from vector-space information retrieval, we wish to automatically learn ranking function for searching such typed graphs. User input is in the form of a partial preference order betw...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • J. Web Sem.

دوره 37-38  شماره 

صفحات  -

تاریخ انتشار 2016